Information-Disturbance Tradeoff in Quantum State Discrimination 
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When discriminating between two pure quantum states, there exists a quantitative tradeoff be- 
tween the information retrieved by the measurement and the disturbance caused on the unknown 
state. We derive the optimal tradeoff and provide the corresponding quantum measurement. Such an 
optimal measurement smoothly interpolates between the two limiting cases of maximal information 
extraction and no measurement at all. 
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I. INTRODUCTION 

The problem of discriminating between two different 
quantum states reveals two main features that make 
Quantum Theory so much different from the classical 
intuition. First, quantum state discrimination involves 
in-principle indistinguishability of quantum states: it is 
well known that it is not possible to perfectly infer (by 
means of a one-shot experiment) which state we even- 
tually picked at random from a set of iron orthogonal 
quantum states. Of course, it is nonetheless possible to 
perform such a decision in an optimal way, e. g., by min- 
imizing the error probability of discrimination [3, by a 
minimax strategy where the smallest of the probabili- 
ties of correct detection is maximized j!2| , or looking for 
optimal unambiguous discrimination [3(, where unambi- 
guity is paid by the possibility of getting inconclusive 
results from the measurement. Second, quantum indis- 
tinguishability principle is closely related to another very 
popular — yet often misunderstood — -principle (formerly 
known as Heisenberg principle 0, H, S] ) : it is not possi- 
ble to extract information from a quantum system with- 
out perturbing it somehow. In fact, if the experimenter 
could gather information about an unknown quantum 
state without disturbing it at all, even if such information 
is partial, by performing further non-disturbing measure- 
ments on the same system, he could finally determine the 
state, in contradiction with the indistinguishability prin- 
ciple 0. 

Actually, there exists a precise tradeoff between the 
amount of information extracted from a quantum mea- 
surement and the amount of disturbance caused on the 
system, analogous to Heisenberg relations holding in the 
preparation procedure of a quantum state. Quantitative 
derivations of such a tradeoff have been obtained in the 
scenario of quantum state estimation [H, Q . The optimal 
tradeoff has been derived in the following cases: in esti- 
mating a single copy of an unknown pure state Q , many 
copies of identically prepared pure qubits [l(| and qudits 
a single copy of a pure state generated by indepen- 
dent phase-shifts [13], an unknown maximally entangled 
state [lH, an unknown coherent state [HI and Gaussian 



state [15j . Experiment realization of minimal disturbance 
measurements has been also reported [3 [l(| . 

The present paper aims at fully characterize such a 
tradeoff relation in quantum state discrimination, in the 
case in which the unknown quantum state is chosen with 
equal a priori probability from a set of two non or- 
thogonal pure states, and the error probability of the 
discrimination is allowed to be suboptimal (thus intu- 
itively causing less disturbance with respect to the op- 
timal discrimination). We explicitly provide a measur- 
ing strategy — both in terms of outcome probabilities 
and state-reduction — that achieves the optimal tradeoff, 
which smoothly interpolates between the two limiting 
cases of maximal information extraction and no mea- 
surement at all. As a byproduct, we also recover in a 
simpler way some of the results of Ref. @ . Our explicit 
derivation of the quantum measurement should allow to 
carry out a feasibility study for the experimental realiza- 
tion of minimal-disturbing measurements. The issue of 
the information-disturbance tradeoff for state discrimina- 
tion can become of practical relevance for posing general 
limits in information eavesdropping and for analyzing se- 
curity of quantum cryptographic communications. 

The paper is organized as follows. In Sec. II, we 
briefly review the problem of minimum-error state dis- 
crimination, and obtain the minimum disturbance for the 
minimum-error measurement. In Sec. Ill, we provide the 
general solution of the optimal information-disturbance 
tradeoff, along with the corresponding measurement in- 
strument. In Sec. IV, we suggest an experimental re- 
alization of the minimum-disturbing measurement and 
conclude the paper with closing remarks. 



II. MINIMUM DISTURBANCE FOR 
MINIMUM-ERROR STATE DISCRIMINATION 

Typically, in quantum state discrimination we are 
given two (fixed) non orthogonal pure states ip\ and ip2, 
with a priori probabilities p\ and P2 = 1 — pi, and 
we want to construct a measurement discriminating be- 
tween the two. In the following, in order to work in 
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full generality, we will describe a measurement by means 
of the quantum instruments formalism [13], namely, a 
collection of completely positive maps {Si}, labelled by 
the measurement outcomes By exploiting the well 
known Kraus decomposition [18l |. one can always write 

£i(p) = Sfc-^fc . In the case the sum comprises 

just one term, namely, £i(p) = EipEj, the map £i is 
called pure, since it maps pure states into pure states. 
The trace Tt[£i{p)] = TrTLp], where IU = Efe-^f^f 
is a positive operator associated to the z-th outcome, pro- 
vides the probability that the measurement performed on 
a quantum system described by the density matrix p gives 
the z-th outcome. The posterior (or reduced) state after 
the measurement is given by pi = £i{p)/ Tr[£i(p)]. The 
averaged reduced state — coming from ignoring the mea- 
surement outcome — is simply obtained using the trace- 
preserving map £ = ^2 i £i- The trace-preservation 
constraint for £ implies that the set of positive opera- 
tors {IT} is actually a positive operator- valued measure 
(POVM), satisfying the completeness condition ^\ IT = 
1. 

Quantum state discrimination is then performed by 
a two-outcome instrument {£1,82} whose capability of 
discriminating between and ip2 can be evaluated by 
the average success probability 

2 

p({£ 1 ,£ 2 }) = Y J p^m\4^){^\)] 

*t (i) 

= ^p i Tr[n i |^)(^|]. 

Notice that P actually depends only on the POVM {IT}. 
The probability P quantifies the amount of information 
that the instrument {£i,£2} is able to extract from the 
ensemble {p\, ipi]p2,ip2}- Among all instruments achiev- 
ing average success probability P (the bar over P means 
that we fix the value of P), we are interested in those 
minimizing the average disturbance caused on the un- 
known state, that we evaluate in terms of average fidelity, 
namely, 

2 

D({£ 1 ,£ 2 },P) = 1 - Y,Pi(i>i\£(\iPi)&i\)Wi). (2) 

i=l 

Differently from P, the disturbance D strongly depends 
on the particular form of the instrument {£i}- This 
means that there exist many different instruments achiev- 
ing the same P, but giving different values of D. Let 

D{P)= min D({£ U £ 2 },P) (3) 
{£1^2} 

be the disturbance produced by the least disturbing in- 
strument that discriminates ipi from ip2 with average suc- 
cess probability P. Intuitive arguments suggest that the 
larger is P, the larger must correspondingly be D (i. c., 
the larger is the amount of information extracted, the 



larger is the disturbance caused by the measurement). 
Our aim is to quantitatively derive such a tradeoff, along 
with the corresponding measurement instrument. From 
now on we will restrict to the case of equal a priori prob- 
abilities, i. e. pi = P2 = 1/2. 

Let us start reviewing the case of the measurement 
maximizing P. First of all notice that, given two gener- 
ally non orthogonal pure states ipi and ip 2 , it is always 
possible to choose an orthonormal basis {|1), |2)}, placed 
symmetrically around ipi and ip2 (see Fig. [1]) , on which 
both states have real components, namely 

\tpi) = cos a |1) + sin a |2), 
IV'2) = sin a |1) + cos a |2), 

and fidelity / = K'^ilV'a)] = sin 2a. In this case, it is 
known jl] that the maximum achievable P is 

P op t = cos 2 a, (5) 

which is obtained by the orthogonal von Neumann mea- 
surement {|1)(1|, |2)(2|}. 




FIG. 1: Helstrom's scheme to optimally discriminate between 
to non orthogonal states ipi and ip2. The orthogonal axes 
1 and 2 correspond to the von Neumann measurement that 
achieves the optimal discrimination probability (5). Accord- 
ing to the measurement outcome, ipi and tp2 are the states to 
be prepared, in order to minimize the disturbance, see Eq. 

Which is the instrument, among all instruments 
achieving P op t, that minimizes the disturbance Dl Let 
us assume for the moment (the optimality of this assump- 
tion will be proved in full generality in the second part of 
the paper) that such an instrument is pure. Intuitively, 
this means that we are excluding a classical shuffling of 
outcomes. Then, since P op t is reached by a rank-one von 
Neumann measurement, we can write 

£ i (p) = U i \i)(i\p\i)(i\Uj > i = l,2, (6) 

where Ui is a unitary operator. Letting Ui\i) = one 
recognizes in Eq. (|6|) a measure-and-prcpare realization: 
the observable \i)(i\ is measured and, depending on the 
outcome, the quantum state ipi is prepared, i. e. one 
has £i(p) = \ipi)(ipi\Tr[p\i)(i\]. By symmetry arguments 
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(under the label exchange "1" <-> "2"). U\ = U\, namely, 
the ipi's are symmetrically tilted with respect to the ipi's, 
see Fig. [1] With this notation, D can be rewritten as 



1 2 

D(p opt ) = i--J2 \(m\ 2 m\i> 3 



(7) 



Since |(i|^j)| 2 = cos 2 a, \(^ i \'tjji)\ 2 = cos 2 (/3— a), and, for 

* ^ 3, = sin 2 a, and |(^i|%}| 2 = sin 2 (a + 0), 

minimizing the disturbance ((?]) resorts to minimizing the 
following function D{(3) of the tilt (3 

D{0) = 1- cos 2 a cos 2 (f3 -a)- sin 2 a sin 2 (a + (3), (8) 

where a is a parameter, fixed along with the input states. 
Solving the equation dD((3)/ d/3 = 0, it turns out that 
the tilt /3 minimizing the disturbance is related to the 
angle a by 



tan 2/3 



tan 2a 
cos 2a ' 



(9) 



in agreement with Ref. [5|]. From the above equation, 
(3 > a. The presence of the tilt (3 can be geometrically 
explained starting from the observation that, for non or- 
thogonal states, minimum error discrimination can never 
be error-free. In other words, even using the optimal 
Helstrom's measurement, there is always a non zero er- 
ror probability, and, the closer the input states are to 
each other, the smaller the success probability is. Hence 
it is reasonable that, the closer the input states are, the 
less "trustworthy" the measurement outcome is, and the 
average disturbance is minimized by cautiously preparing 
a new state that actually is a coherent superpositions of 
both hypotheses tpi and Using Eq. (J9J), from Eq. ([8]) 
one obtains the minimum disturbance for Helstrom's op- 
timal measurement 



4 - v / 14 + 2cos8a 



(10) 



Notice that -D pt reaches its maximum for a = tt/8, 
namely, when ipx and ip2 are "unbiased" with respect 
to each other (K^il^)! 2 = 1/2). 



III. THE GENERAL SOLUTION 

We analysed the limiting case in which the informa- 
tion extraction is maximized — i. e. the average success 
probability is maximized. The opposite limiting case is 
when we do not perform any measurement at all, with- 
out disturbing the states. The main result of the paper 
is to provide the optimal tradeoff for all intermediate sit- 
uations, along with the corresponding quantum instru- 
ment. In order to do this, it is useful to exploit the 
Choi-Jamiolkowski isomorphism [l9| between completely 
positive maps M. on states on 7i and positive operators 
R M on n^H 



M 



r m = (M®i)\n){Q.\ 



(11) 



where = X)fe=i 1^) ® 1^) ^ s the (non normalized) max- 
imally entangled vector in the d 2 -dimensional Hilbcrt 
space Ji ® TL (in our case, H is two-dimensional). The 
correspondence pip is one-to-one, the inverse formula 
being 



M(p) = Tr 2 [(l®p*) R M ], 



(12) 



where Tr 2 denotes the trace over the second Hilbcrt 
space, and p* is the complex conjugated of p, with re- 
spect to the basis fixed by |f2) in Eq. (fTTj) . In terms 
of Choi-Jamiolkowski operator, trace-preservation condi- 
tion is given by Tri [Rm] = 1- 

An instrument {£±,£2} can then be put in corre- 
spondence with a set of positive operators {i?!,i? 2 }. 
Clearly, < Tri[iZi] < 1 and < T ri [i? 2 ] < 1, while 
Tri[i?i + R2] = 1, since the total operator R = R% + i? 2 
corresponds to the trace- preserving map £ = £1 + £ 2 . 
The average success probability ([1]) and the average dis- 
turbance @ can be rewritten as 



1 



(13) 



(=1 



1 2 

d = 1 -r^MWW ® mmi r], (14) 



respectively. (In the following, we will drop the star, 
since ipi and '02 have real components over the basis 
{|1),|2)}.) Our strategy is to fix the average success 
probability 1/2 < P t < P opt by fixing the value of a 
control parameter t, i. e. 



Pt = tPo 



1 -t 



pt 



= t cos a 



(15) 



with < t < 1, and then to search, among all possible 
measurements achieving P t , for the one minimizing the 
disturbance D(P t ). In the symmetric case, p\ = p 2 = 
1/2, the minimization problem can be strikingly simpli- 
fied by exploiting the exchange symmetry |?/>i) = Pxlfo), 
'O 1 N 



where a , 



1 



in the {|1), |2)} basis. It is then sim- 



ple to check that, given an instrument {Ri, R2} achieving 
average success probability P and disturbance D, the in- 
strument {R[,R' 2 } constructed as 



Ri = ^(Ri 



! R j0 - 



x 2 



)i * ^ 3, 



(16) 



achieves the same values of P and D as well. Hence, 
without loss of generality, we can restrict ourselves to 
instruments satisfying i? 2 = a® 2 R\cr® 2 . Then, the av- 
erage disturbance (| 14[) can be rewritten as D = 1 — 
Tr[E i?i], where E = £\ \A) (A\® 2 , and the optimiza- 
tion problem ([3|) over a two-outcome instrument resorts 
to the following — much simpler — optimization over a sin- 
gle positive operator R\ 

D t = min D(P t ) = 1 - maxTrfE (17) 

Ri 
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with the trace-preservation constraint Tri[i?i + 
af 2 Riaf 2 } — 1, and the constraint of av- 
erage success probability equal to Pt, namely 
Tr[(l (g) \ipi)(ipi\) Ri] = Pt- These constraints can 
be recast as four linear conditions: 



Ri > 0, 
Tr[(l®<r x ) Rx] 

'l 



0, 



Ti[R r ] = 1, 
Trp<g><x z ) Rj] 



(18) 



2P t - 1 
cos 2a 



where a 7 = 



-1 



Basic linear programming meth- 



ods, along with the reconstruction formula (fT2"l) . show 
that, for every value of Pj, the minimum disturbance D t 
is achieved by the pure instrument Sf ] {p) = Ef ] pE^ m , 
where 



E[ t] = U(t) 



ft 



E. 



(/) 



= U\t) (- 



yT+7 . 
2 



(19) 



with 7 = vl — £ 2 . The unitary operator f/(£) in the 
above equation generalizes that in Eq. ^ as follows 



where [2C 



E/(i) 



tan2/3 t 



cos /3 t sin p t 
— sin /3 t cos p t 



t sin 2a 
cos 2 2a + 7 sin 2 2a 



(20) 



(21) 



It follows that every instrument that achieves average 
success probability P t must cause at least an average dis- 
turbance 



D t = - (1 -£sin2asin2/3t) 



cos 2p t 



[7(cos4a — 1) — cos 4a — 1] 



It is also simple to check that 



n 



it) 



E m E {t) 



t\i}(i\ 



1-t 



■1, 



(22) 



(23) 



namely, the POVM of the measurement is the convex 
mixture of the optimal one {|1)(1|, |2)(2|} and a com- 
pletely random one. On the contrary, the instrument 
operators (|19p represents a coherent superposition of Hcl- 
strom's (see Eq. ((6])) and the identity map. 

Just by varying the control parameter t, it is possible 
to smoothly move between the limiting cases. For t = 0, 
we obtain the identity map, that is, the no-measurement 
case. For t = 1, we obtain Helstrom's instrument in 
Eq. ([6]), thus proving that assuming pure instruments is 
in fact the optimal choice. In particular, Eq. (|21[) pro- 
vides the tilt given in Eq. ([9]). However, the crucial differ- 
ence between Helstrom's limit (t = 1) and the intermedi- 
ate cases is that, for t < 1, the optimal instrument can- 
not be interpreted by means of a measure-and-prepare 



scheme, and the unitaries U(t) and U^it) in Eq. (fT9]) 
represent feedback rotations for outcomes 1 and 2. 

By eliminating the parameter t from Eqs. (|15[) 
and (|2"2"j) . we obtain the optimal tradeoff D(P) between 
information and disturbance, for any value of a, namely 
for any couple of states with fidelity / = sin 2a. We 
plot D(P) in Fig. [2] for three different values of /. i.e. 
P 



3 1 
4' 2' 




0.6 0.7 



0.9 1 



FIG. 2: Optimal tradeoff between disturbance and discrim- 
ination probability of two pure states i/ii and tf)2 for three 
different values of the fidelity / = |(V>i|^2}|, i.e. f 2 = |, 5, \. 
The convolution of ending points (the dashed curve) provides 
the minimal disturbance for Helstrom's optimal measurement, 
namely D opt of Eq. (|10[l , with P = cos 2 a. 

The expression of D(P) is rather involved, however 
it can be simplified upon introducing the renormalized 
quantities J" and 3 as follows 



J = 



P - 1/2 



D 



P 



opt 



1/2' 



D, 



(24) 



opt 



where P opt and D opt are given in Eqs. ([5]) and (flUf . 
respectively. Clearly, one has < J?,2t < 1. After 
some lengthy algebra, we recover the following result of 
Ref. without any assumption: the optimal tradeoff 
between the amount of information .y retrieved from the 
measurement and the disturbance & caused on the state 
is given by 



D opt 9(l - D, 



sin(4a) 



opt- 



(l - v 7 ! - ^ 2 ) ■ (25) 



For an optimal instrument, equality (|25p holds for any 
value of a, whereas for any suboptimal instrument the 
l.h.s is strictly larger than the r.h.s. 



IV. CONCLUSION 

In conclusion, a tight bound between the probability 
of discriminating two pure quantum states and the de- 
gree the initial state has to be changed by a quantum 
measurement has been derived. Such a bound can be 
achieved by a noisy measurement instrument, where the 
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noise continuously controls the tradeoff between the in- 
formation retrieved by the measurement and the distur- 
bance on the original state. More precisely, the opti- 
mal POVM is given by the convex combination of the 
minimum-error POVM and the completely uninforma- 
tive one, whereas the measurement instrument is given 
by the coherent superposition of the minimum-disturbing 
instrument for the optimal discrimination and the iden- 
tity map. 

We finally suggest two possible experimental realiza- 
tions of the minimum-disturbing measurement, whose 
details will be published elsewhere [2l[. Since we are 
interested not only in the success probability but also 
in the posterior state of the system after the measure- 
ment, we have to focus on a possible indirect measure- 
ment scheme, in which the system is made interact with 
a probe, in such a way they get entangled. After such 
interaction takes place, a projective measurement is per- 
formed on the probe. The mathematical parameter t con- 
trolling the tradeoff in Eq. (|15[) can then be put in cor- 
respondence with a physical parameter controlling the 
strength of the interaction between the system and the 
probe: t — means that the interaction is actually fac- 
torizcd in such a way that the following measurement on 
the probe does not provide any information about the 
system and the system is completely unaffected by the 
probe's measurement, that is, the no-measurement case. 



On the contrary, t = 1 identifies a completely entangling 
interaction, or, in other words, a situation in which a 
measurement on the probe gives the largest amount of 
information about the system, consequently causing the 
largest disturbance. Two possible schemes for two-level 
systems encoded on photons satisfy our requirements, 
that is, an entangling interaction produced by means of a 
non-linear Kerr medium [22j, or an entangling measure- 
ment realized as a parity check [l6[. The first approach, 
even if deterministic — i. e. no events have to be discarded 
in principle — has serious drawbacks in reaching the value 
t = 1, since too large Kerr nonlinearity is needed [22j ■ On 
the other hand, the second approach is probabilistic — one 
half of the events are discarded — but it is based just on 
linear optics and it has been already implemented and 
successfully tested [l6j . 
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